Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies
نویسندگان
چکیده
BACKGROUND selecting the correct statistical test and data mining method depends highly on the measurement scale of data, type of variables, and purpose of the analysis. Different measurement scales are studied in details and statistical comparison, modeling, and data mining methods are studied based upon using several medical examples. We have presented two ordinal-variables clustering examples, as more challenging variable in analysis, using Wisconsin Breast Cancer Data (WBCD). ORDINAL-TO-INTERVAL SCALE CONVERSION EXAMPLE a breast cancer database of nine 10-level ordinal variables for 683 patients was analyzed by two ordinal-scale clustering methods. The performance of the clustering methods was assessed by comparison with the gold standard groups of malignant and benign cases that had been identified by clinical tests. RESULTS the sensitivity and accuracy of the two clustering methods were 98% and 96%, respectively. Their specificity was comparable. CONCLUSION by using appropriate clustering algorithm based on the measurement scale of the variables in the study, high performance is granted. Moreover, descriptive and inferential statistics in addition to modeling approach must be selected based on the scale of the variables.
منابع مشابه
Credit Card Fraud Detection using Data mining and Statistical Methods
Due to today’s advancement in technology and businesses, fraud detection has become a critical component of financial transactions. Considering vast amounts of data in large datasets, it becomes more difficult to detect fraud transactions manually. In this research, we propose a combined method using both data mining and statistical tasks, utilizing feature selection, resampling and cost-...
متن کاملAnalysis of Pre-processing and Post-processing Methods and Using Data Mining to Diagnose Heart Diseases
Today, a great deal of data is generated in the medical field. Acquiring useful knowledge from this raw data requires data processing and detection of meaningful patterns and this objective can be achieved through data mining. Using data mining to diagnose and prognose heart diseases has become one of the areas of interest for researchers in recent years. In this study, the literature on the ap...
متن کاملFatigue Assessment Scales: A comprehensive literature review
Background & Aims of the Study: Fatigue is one of the most important issues relating with safety and other aspects of human life. To understand fatigue and its relative factors and causes, there is a need to useful instruments, such as self-reported scales. The purpose of this study is to identify and present useful self-reported scales to measure fatigue. Materials & Methods: Data were extrac...
متن کاملEvaluation of Sensory Pathways in Spinal Cord by Comparison of fMRI Methodologies
Introduction: Today, clinicians and neuroscientists need to have a comprehensive survey of neurological pathologies and injuries. For the First-time, SEEP contrast and Spin-Echo pulse sequences was used for functional imaging of the Lumbar spinal cord. This method used by several research groups for Spinal cord mapping, but other researchers tried to improve BOLD fMRI to Spina...
متن کاملCredit scoring in banks and financial institutions via data mining techniques: A literature review
This paper presents a comprehensive review of the works done, during the 2000–2012, in the application of data mining techniques in Credit scoring. Yet there isn’t any literature in the field of data mining applications in credit scoring. Using a novel research approach, this paper investigates academic and systematic literature review and includes all of the journals in the Science direct onli...
متن کامل